智能论文笔记

Interventions, Where and How? Experimental Design for Causal Models at Scale

Panagiotis Tigas , Yashas Annadani , Andrew Jesson , Bernhard Schölkopf , Yarin Gal , Stefan Bauer

分类：机器学习 | 人工智能 | (统计)机器学习

2022-03-03

由于数据有限和非识别性，观察性和介入数据的因果发现是具有挑战性的：在估计基本结构因果模型（SCM）时引入不确定性的因素。基于这两个因素引起的不确定性选择实验（干预措施）可以加快SCM的识别。来自有限数据的因果发现实验设计中的现有方法要么依赖于SCM的线性假设，要么仅选择干预目标。这项工作将贝叶斯因果发现的最新进展纳入了贝叶斯最佳实验设计框架中，从而使大型非线性SCM的积极因果发现同时选择了介入目标和值。我们证明了对线性和非线性SCM的合成图（ERDOS-R \'enyi，breetr cable）以及在\ emph {intiLico}单细胞基因调节网络数据集的\ emph {inyeare scms的性能。

translated by 谷歌翻译

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

Andrew Jesson , Panagiotis Tigas , Joost van Amersfoort , Andreas Kirsch , Uri Shalit , Yarin Gal

分类：机器学习 | (统计)机器学习

2021-11-03

估算高维观测数据的个性化治疗效果在实验设计不可行，不道德或昂贵的情况下是必不可少的。现有方法依赖于拟合对治疗和控制人群的结果的深层模型。然而，当测量单独的结果是昂贵的时，就像肿瘤活检一样，需要一种用于获取每种结果的样本有效的策略。深度贝叶斯主动学习通过选择具有高不确定性的点来提供高效数据采集的框架。然而，现有方法偏置训练数据获取对处理和控制群体之间的非重叠支持区域。这些不是样本效率，因为在这些区域中不可识别治疗效果。我们介绍了因果关系，贝叶斯采集函数接地的信息理论，使数据采集朝向具有重叠支持的地区，以最大限度地提高学习个性化治疗效果的采样效率。我们展示了拟议的综合和半合成数据集IHDP和CMNIST上提出的收购策略及其扩展的表现，旨在模拟常见的数据集偏差和病理学。

translated by 谷歌翻译

Seafloor-Invariant Caustics Removal from Underwater Imagery

Panagiotis Agrafiotis , Konstantinos Karantzalos , Andreas Georgopoulos

分类：计算机视觉

2022-12-20

Mapping the seafloor with underwater imaging cameras is of significant importance for various applications including marine engineering, geology, geomorphology, archaeology and biology. For shallow waters, among the underwater imaging challenges, caustics i.e., the complex physical phenomena resulting from the projection of light rays being refracted by the wavy surface, is likely the most crucial one. Caustics is the main factor during underwater imaging campaigns that massively degrade image quality and affect severely any 2D mosaicking or 3D reconstruction of the seabed. In this work, we propose a novel method for correcting the radiometric effects of caustics on shallow underwater imagery. Contrary to the state-of-the-art, the developed method can handle seabed and riverbed of any anaglyph, correcting the images using real pixel information, thus, improving image matching and 3D reconstruction processes. In particular, the developed method employs deep learning architectures in order to classify image pixels to "non-caustics" and "caustics". Then, exploits the 3D geometry of the scene to achieve a pixel-wise correction, by transferring appropriate color values between the overlapping underwater images. Moreover, to fill the current gap, we have collected, annotated and structured a real-world caustic dataset, namely R-CAUSTIC, which is openly available. Overall, based on the experimental results and validation the developed methodology is quite promising in both detecting caustics and reconstructing their intensity.

translated by 谷歌翻译

FNDaaS: Content-agnostic Detection of Fake News sites

Panagiotis Papadopoulos , Dimitris Spithouris , Evangelos P. Markatos , Nicolas Kourtellis

分类：机器学习

2022-12-13

Automatic fake news detection is a challenging problem in misinformation spreading, and it has tremendous real-world political and social impacts. Past studies have proposed machine learning-based methods for detecting such fake news, focusing on different properties of the published news articles, such as linguistic characteristics of the actual content, which however have limitations due to the apparent language barriers. Departing from such efforts, we propose FNDaaS, the first automatic, content-agnostic fake news detection method, that considers new and unstudied features such as network and structural characteristics per news website. This method can be enforced as-a-Service, either at the ISP-side for easier scalability and maintenance, or user-side for better end-user privacy. We demonstrate the efficacy of our method using data crawled from existing lists of 637 fake and 1183 real news websites, and by building and testing a proof of concept system that materializes our proposal. Our analysis of data collected from these websites shows that the vast majority of fake news domains are very young and appear to have lower time periods of an IP associated with their domain than real news ones. By conducting various experiments with machine learning classifiers, we demonstrate that FNDaaS can achieve an AUC score of up to 0.967 on past sites, and up to 77-92% accuracy on newly-flagged ones.

translated by 谷歌翻译

Probabilistic Verification of ReLU Neural Networks via Characteristic Functions

Joshua Pilipovsky , Vignesh Sivaramakrishnan , Meeko M. K. Oishi , Panagiotis Tsiotras

分类：机器学习

2022-12-03

Verifying the input-output relationships of a neural network so as to achieve some desired performance specification is a difficult, yet important, problem due to the growing ubiquity of neural nets in many engineering applications. We use ideas from probability theory in the frequency domain to provide probabilistic verification guarantees for ReLU neural networks. Specifically, we interpret a (deep) feedforward neural network as a discrete dynamical system over a finite horizon that shapes distributions of initial states, and use characteristic functions to propagate the distribution of the input data through the network. Using the inverse Fourier transform, we obtain the corresponding cumulative distribution function of the output set, which can be used to check if the network is performing as expected given any random point from the input set. The proposed approach does not require distributions to have well-defined moments or moment generating functions. We demonstrate our proposed approach on two examples, and compare its performance to related approaches.

translated by 谷歌翻译

AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments

Mehregan Dor , Travis Driver , Kenneth Getzandanner , Panagiotis Tsiotras

分类：机器人 | 计算机视觉

2022-12-01

We propose AstroSLAM, a standalone vision-based solution for autonomous online navigation around an unknown target small celestial body. AstroSLAM is predicated on the formulation of the SLAM problem as an incrementally growing factor graph, facilitated by the use of the GTSAM library and the iSAM2 engine. By combining sensor fusion with orbital motion priors, we achieve improved performance over a baseline SLAM solution. We incorporate orbital motion constraints into the factor graph by devising a novel relative dynamics factor, which links the relative pose of the spacecraft to the problem of predicting trajectories stemming from the motion of the spacecraft in the vicinity of the small body. We demonstrate the excellent performance of AstroSLAM using both real legacy mission imagery and trajectory data courtesy of NASA's Planetary Data System, as well as real in-lab imagery data generated on a 3 degree-of-freedom spacecraft simulator test-bed.

translated by 谷歌翻译

Reproducibility in medical image radiomic studies: contribution of dynamic histogram binning

Darryl E. Wright , Cole Cook , Jason Klug , Panagiotis Korfiatis , Timothy L. Kline

分类：计算机视觉

2022-11-09

The de facto standard of dynamic histogram binning for radiomic feature extraction leads to an elevated sensitivity to fluctuations in annotated regions. This may impact the majority of radiomic studies published recently and contribute to issues regarding poor reproducibility of radiomic-based machine learning that has led to significant efforts for data harmonization; however, we believe the issues highlighted here are comparatively neglected, but often remedied by choosing static binning. The field of radiomics has improved through the development of community standards and open-source libraries such as PyRadiomics. But differences in image acquisition, systematic differences between observers' annotations, and preprocessing steps still pose challenges. These can change the distribution of voxels altering extracted features and can be exacerbated with dynamic binning.

translated by 谷歌翻译

Risk-Aware Model Predictive Path Integral Control Using Conditional Value-at-Risk

Ji Yin , Zhiyuan Zhang , Panagiotis Tsiotras

分类：机器人

2022-09-26

在本文中，我们为自主机器人提供了一种新型的模型预测控制方法，受到任意形式的不确定性。拟议的风险感知模型预测路径积分（RA-MPPI）控制利用条件价值（CVAR）度量来为安全关键的机器人应用生成最佳控制动作。与大多数现有的随机MPC和CVAR优化方法不同，这些方法将原始动力学线性化并将控制任务制定为凸面程序，而拟议的方法直接使用原始动力学，而无需限制成本函数或噪声的形式。我们将新颖的RA-MPPI控制器应用于自动驾驶汽车，以在混乱的环境中进行积极的驾驶操作。我们的仿真和实验表明，与基线MPPI控制器相比，提出的RA-MPPI控制器可以达到大约相同的圈时间，而碰撞的碰撞明显少得多。所提出的控制器以高达80Hz的更新频率执行在线计算，利用现代图形处理单元（GPU）来进行多线程轨迹以及CVAR值的生成。

translated by 谷歌翻译

MLGWSC-1: The first Machine Learning Gravitational-Wave Search Mock Data Challenge

Marlin B. Schäfer , Ondřej Zelenka , Alexander H. Nitz , He Wang , Shichao Wu , Zong-Kuan Guo , Zhoujian Cao , Zhixiang Ren , Paraskevi Nousi , Nikolaos Stergioulas

分类：机器学习

2022-09-22

我们介绍了第一个机器学习引力波搜索模拟数据挑战（MLGWSC-1）的结果。在这一挑战中，参与的小组必须从二进制黑洞合并中识别出复杂性和持续时间逐渐嵌入在逐渐更现实的噪声中的引力波信号。 4个提供的数据集中的决赛包含O3A观察的真实噪声，并发出了20秒的持续时间，其中包含进动效应和高阶模式。我们介绍了在提交前从参与者未知的1个月的测试数据中得出的6个输入算法的平均灵敏度距离和运行时。其中4个是机器学习算法。我们发现，最好的基于机器学习的算法能够以每月1个的错误警报率（FAR）的速度（FAR）实现基于匹配过滤的生产分析的敏感距离的95％。相反，对于真实的噪音，领先的机器学习搜索获得了70％。为了更高的范围，敏感距离缩小的差异缩小到某些数据集上选择机器学习提交的范围$ \ geq 200 $以优于传统搜索算法的程度。我们的结果表明，当前的机器学习搜索算法可能已经在有限的参数区域中对某些生产设置有用。为了改善最新的技术，机器学习算法需要降低他们能够检测信号并将其有效性扩展到参数空间区域的虚假警报率，在这些区域中，建模的搜索在计算上很昂贵。根据我们的发现，我们汇编了我们认为，将机器学习搜索提升到重力波信号检测中的宝贵工具，我们认为这是最重要的研究领域。

translated by 谷歌翻译

Information-theoretic Abstraction of Semantic Octree Models for Integrated Perception and Planning

Daniel T. Larsson , Arash Asgharivaskasi , Jaein Lim , Nikolay Atanasov , Panagiotis Tsiotras

分类：机器人

2022-09-20

在本文中，我们开发了一种方法，该方法使自主机器人能够从点云数据构建和压缩语义环境表示。我们的方法从传感器数据中构建了环境的三维语义树表示，然后通过一种新型的信息理论树木修复方法来压缩。所提出的方法是概率的，并将其纳入现实世界中固有的语义分类中。此外，我们的方法允许机器人在生成压缩树时优先考虑单个语义类，以设计保留相关语义信息的多分辨率表示，同时丢弃不需要的语义类别。我们通过压缩大型户外，语义丰富，真实世界环境的语义OCTREE模型来演示方法。此外，我们还展示了如何使用OCTREE抽象来创建语义信息图以进行运动计划，并使用未知的图形构造方法（例如Halton序列）进行比较。

translated by 谷歌翻译